Rooted Trees with Probabilities Revisited

نویسنده

Georg Böcherer

چکیده

Rooted trees with probabilities are convenient to represent a class of random processes with memory. They allow to describe and analyze variable length codes for data compression and distribution matching. In this work, the Leaf-Average Node-Sum Interchange Theorem (LANSIT) and the well-known applications to path length and leaf entropy are re-stated. The LANSIT is then applied to informational divergence. Next, the differential LANSIT is derived, which allows to write normalized functionals of leaf distributions as an average of functionals of branching distributions. Joint distributions of random variables and the corresponding conditional distributions are special cases of leaf distributions and branching distributions. Using the differential LANSIT, Pinsker’s inequality is formulated for rooted trees with probabilities, with an application to the approximation of product distributions. In particular, it is shown that if the normalized informational divergence of a distribution and a product distribution approaches zero, then the entropy rate approaches the entropy rate of the product distribution. 2 / 34 ar X iv :1 30 2. 07 53 v1 [ cs .I T ] 4 F eb 2 01 3 Probability notation I Random variable X , takes values in X I Distribution PX : for each a ∈ X : PX (a) := Pr(X = a). I Support suppPX := {a ∈ X : PX (a) > 0}.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The probabilities of trees and cladograms under Ford's $\alpha$-model

We give correct explicit formulas for the probabilities of rooted binary trees and cladograms under Ford’s α-model.

متن کامل

Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.

The star-tree paradox refers to the conjecture that the posterior probabilities for the three unrooted trees for four species (or the three rooted trees for three species if the molecular clock is assumed) do not approach 1/3 when the data are generated using the star tree and when the amount of data approaches infinity. It reflects the more general phenomenon of high and presumably spurious po...

متن کامل

4-PLACEMENT OF ROOTED TREES

A tree T of order n is called k-placement if there are k edge-disjoint copies of T into K_{n}. In this paper we prove some results about 4-placement of rooted trees.

متن کامل

Properties of consensus methods for inferring species trees from gene trees.

Consensus methods provide a useful strategy for summarizing information from a collection of gene trees. An important application of consensus methods is to combine gene trees to estimate a species tree. To investigate the theoretical properties of consensus trees that would be obtained from large numbers of loci evolving according to a basic evolutionary model, we construct consensus trees fro...

متن کامل

Probabilities on cladograms: introduction to the alpha model

The alpha model, a parametrized family of probabilities on cladograms (rooted binary leaf labeled trees), is introduced. This model is Markovian self-similar, deletion-stable (sampling consistent), and passes through the Yule, Uniform and Comb models. An explicit formula is given to calculate the probability of any cladogram or tree shape under the alpha model. Sackin's and Colless' index are s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1302.0753 شماره

صفحات -

تاریخ انتشار 2013

Rooted Trees with Probabilities Revisited

نویسنده

چکیده

منابع مشابه

The probabilities of trees and cladograms under Ford's $\alpha$-model

Fair-balance paradox, star-tree paradox, and Bayesian phylogenetics.

4-PLACEMENT OF ROOTED TREES

Properties of consensus methods for inferring species trees from gene trees.

Probabilities on cladograms: introduction to the alpha model

عنوان ژورنال:

اشتراک گذاری